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Abstract 

We show that for ah sufficiently large d, the uniform proper 3-coloring model 
(in physics called the 3-state antiferromagnetic Potts model at zero temperature) 
on Z*^ admits multiple maximal-entropy Gibbs measures. This is a consequence 
of the following combinatorial result: if a proper 3-coloring is chosen uniformly 
from a box in Z*^, conditioned on color being given to all the vertices on the 
boundary of the box which are at an odd distance from a fixed vertex v in the 
box, then the probability that v gets color is exponentially small in d. 

The proof proceeds through an analysis of a certain type of cutset separating 
V from the boundary of the box, and builds on techniques developed by Galvin 
and Kahn in their proof of phase transition in the hard-core model on Z"^. 

Building further on these techniques, we study local Markov chains for sam- 
pling proper 3-colorings of the discrete torus Z^. We show that there is a constant 
p ~ 0.22 such that for all even n > 4 and d sufficiently large, if is a Markov 
chain on the set of proper 3-colorings of Z$i[ that updates the color of at most 
pn'^ vertices at each step and whose stationary distribution is uniform, then the 
mixing time of Ai (the time taken for M to reach a distribution that is close to 
uniform, starting from an arbitrary coloring) is essentially exponential in n'^~^. 



1 Introduction 

A (proper) q-coloring of a graph G = {V, E) is a function x • ^iG) [q] satisfying 
x{u) 7^ x(t') whenever uv G E, where we use the notation [q] = {0,...,g — 1}. In 

*Department of Mathematics, University of Notre Dame, Notre Dame IN, USA; dgalvinl@nd.edu. 

^Department of Mathematics, Rutgers University, New Brunswick NJ, USA; 
jkahn@math.rutgers.edu. 

■I-School of Computer Science, Georgia Institute of Technology, Atlanta GA, USA; ran- 
dall@cc.gatech.edu. 

^Departments of Management and Mathematics, London School of Economics, London, England; 
g.b.sorkin@lse.ac.uk. 



1 



the language of statistical physic, a g-coloring of G is a configuration in the zero- 
temperature q-state antiferromagnetic Potts model on G [29j. This is a simple model of 
the occupation of space by a collection of q types of particles: the vertices of G represent 
sites, each occupied by exactly one particle, and the edges of G represent pairs of sites 
that are bonded (by spatial proximity, for example) and cannot be occupied by particles 
of the same type. We write Cq{G), or simply Cg, for the set of g-colorings of G. 

A basic question concerning Cq is, what does a typical (uniformly chosen) element 
look like? For finite G, uniform measure on Cq is unambiguous. For infinite G, the 
standard approach to defining uniform measure on Cq is through the notion of a Gibbs 
measure, which, roughly speaking, is a measure on Cq whose restriction to any finite 
subset of V is uniform. 

Formally, for finite W O V let nw+ be uniform measure on the subgraph of G 
induced hj W U 9extW^, where c?extW^ is the set of vertices outside W that are adjacent 
to something in W. Let fi he a. measure on (Cg,J-'cyi), where J-'cyi is the a-algebra 
generated by the cylinder events {x{fj) = i} for v E V and i E [q]. We say that is 
a Gibbs measure (with uniform specification) if the following condition holds: for all 
finite W V and almost-all x ^ Cg, the probability that x' agrees with x on W 
given that it agrees with x oS W, with x' drawn according to fi, is the same as the 
probability that x' agrees with x on W given that it agrees with x on d^xtW, with x' 
drawn according (See e.g. [TT] for a thorough treatment of this topic). 

General compactness arguments show that an infinite graph G admits at least one 
Gibbs measure. A simple recipe for producing one is the following. For % G Cg = Cq{G) 
and W OV, let C^(W) be the set of colorings that agree with x off W. Fix x G and 
a nested sequence (VFj)^i of finite subsets of V satisfying UjVFj = V. For each i let fif 
be the (finitely supported) uniform measure on C^{Wi). Any (weak) subsequential limit 
of the /if 's (and by compactness there must be at least one such) is a Gibbs measure. 
This fact was originally proved, in a much more general context, by Dobrushin [8]; see 
e.g. lU Theorem 3.5] for a simple proof in the present context. 

A central concern in statistical physics (again see [T7] for a thorough discussion) 
is understanding when a particular system — in our case the g-coloring model — 
exhibits phase coexistence (a.k.a. phase transition) on a given infinite G, meaning that 
it admits more than one Gibbs measure. Actually, as we explain below, what we are 
really interested in is whether there are multiple Gibbs measures that are all substantial 
in an appropriate sense. 

Our particular concern here is with G = Z'^, the usual nearest-neighbor graph on 
the (i- dimensional integer lattice. This is a bipartite graph, with bipartition classes 
S (the even vertices, the set of lattice points the sum of whose coordinates is even) 
and O (the odd vertices). We will also use S and O for induced partition classes of 
subgraphs of Z'^. Intuition suggests that, for large d, the set of 3-colorings of Z'^ should 
mainly consist of six classes, each identified by a predominance of one of the colors on 
one of £, O, with the other two colors mainly assigned to the other partition class; 
thus (again for large enough d) the set of Gibbs measures should include six distinct 
measures corresponding to these classes. A well-known conjecture that this is the case 
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goes back at least to R. Kotecky circa 1985 ([23]; see e.g. [22] for context), although 
the explicit conjecture seems not to have appeared in print. 

Our first main result verifies Kotecky's conjecture. To state the result precisely, we 
set up some notation. Let x(0, C) G C3 be any 3-coloring of satisfying x|o = 0. 
For each n G N, let Wn consist of the box {— n, . . . , n}'^ together with all of the odd 
vertices of the box {^—{n + 1), . . . , n + l}*^. Let v & £ and w & O he fixed vertices of 
Tf^. Let /i*^'''^^ be any subsequential limit of the /in^°''^'''s. 

Theorem 1.1 With notation as above, 



and 



,(0,O) 



> 1/2 -e-^^'^) z/me{l,2} 

> 1/2 -e-^^'^) z/m = 



< 1/4 + e-^w z/mG {1,2}. 



This immediately implies that /i^^''^) together with fj,^^'^\ fj,^^'^^ and /i^^'^^ 

(all defined in the obvious way) form a collection of six distinct Gibbs measures for all 
sufficiently large d. 

It is possible for a Gibbs measure to be trivial. For example, if x G C3(Z^) is the 
mod 3 coloring (satisfying x{{^^ u)) = x + y (mod 3)) and Wi is the £00 ball of radius i, 
then it is straightforward to check that the only coloring that agrees with x off is 
X itself, and so the /i^'s in this case have as their unique limit the Gibbs measure with 
support {x}- (See e.g. |5] for other examples of such "frozen" Gibbs measures for the 
g-coloring model on the infinite regular tree.) These trivialities are avoided if we focus 
on Gibbs measures of maximal entropy (essentially measures with substantial support; 
see Section [5] for a precise definition). Kotecky's conjecture as originally told to us [23] 
was that the 3-coloring model in high dimension admits multiple Gibbs measures of 
maximal entropy. 

Theorem 1.2 The Gibbs measure jj,^'^''^^ constructed above is a measure of maximal 
entropy. 

At present our methods do not extend beyond q = 3, but we strongly believe that 
the phenomenon of phase coexistence for the g-coloring model on Z'^ occurs for all 
g > 3. A resolution of the following conjecture would be of great interest in both the 
statistical physics and discrete probability communities. 

Conjecture 1.3 For all q > 3 and all sufficiently large d = d{q), there is more than 
one Gibbs measure of maximal entropy for the q-coloring model on . 

The natural expectation is that for odd q there are at least 2(^^^2j) such measures 
and for even q at least (^^2) such, corresponding to choices of a partition \q\ = A\J B 
with \A\ = [_q/2\ and a partition class of Z"^ on which colors from A are preferred. 
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(Note that the issue here is only the analog of Theorem Theorem 11.21 extends 
without difficulty.) The analogous statement for proper g-colorings of the Hamming 
cube {0, l}'^ was proved in flG\ . 




In this paper we also consider the problem of using Markov chains to sample uni- 
formly at random from the set Cq{G), for finite G. Sampling and counting colorings of 
a graph are fundamental problems in computer science and discrete mathematics. One 
approach is to design a Markov chain whose stationary distribution is uniform over 
the set of colorings of G. Then, starting from an arbitrary coloring and simulating a 
random walk according to this chain for a sufficient number of steps, we get a sample 
from a distribution which is close to uniform. The number of steps required for the 
distribution to get close to uniform is referred to as the mixing time (see e.g. [22]). The 
chain is called rapidly mixing if the mixing time is polynomial in \V\ (so it converges 
quickly to stationarity); it is torpidly mixing if its mixing time is super-polynomial in 
\V\ (so it converges slowly). There has been a long history of studying mixing times 
of various chains in the context of colorings (see e.g. [H [TTl [181 [HI fSHl 125]). 

A particular focus of this study has been on Glauber dynamics. For g-colorings this 
is any single-site update Markov chain that connects two colorings only if they differ 
on at most a single vertex. The Metropolis chain Aig on state space Cq has transition 
probabilities Pg{xuX2), Xi,X2 e Cg, given by 



We may think of A^g dynamically as follows. From a g-coloring x, choose a vertex v 
uniformly from V and a color j uniformly from [q]. Then recolor v with color j if the 
result is a (proper) g-coloring; otherwise stay at x- 

When Aiq is ergodic, its stationary distribution vr^ is uniform over g-colorings. A 
series of recent papers has shown that A^g is rapidly mixing provided the number of 
colors is sufficiently large compared to the maximum degree (see [11] and the references 
therein). Substantially less is known when the number of colors is small. In fact, for q 
small it is NP-complete to decide whether a graph admits even one g-coloring (see e.g. 



In this paper we consider the mixing rate of Aiq on rectangular regions of Z'^. It 
is known [25j that for q > 3, Glauber dynamics is ergodic (connects the state space of 
g-colorings) on any such lattice region. In much is known about the mixing rate 
of Aiq. Randall and Tetali [30], building on work of Luby et al. [22], showed that 
Glauber dynamics for sampling 3-colorings is rapidly mixing on any finite, simply- 
connected subregion of Z^ when the colors on the boundary of the region are fixed. 
Goldberg et al. [18] subsequently showed that the chain remains fast on rectangular 
regions without this boundary restriction. Substantially more is known when there 
are many colors: Jerrum [20] showed that Glauber dynamics is rapidly mixing on any 
graph satisfying q > 2 A, where q is the number of colors and A is the maximum degree. 





if\{veV:xi{v)7^X2{v)}\>l; 
ii\{veV:xi{v)^X2{v)}\ = l; 

if Xi = X2- 



m 
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thus showing Glauber dynamics is fast on 1? when g > 8. It has since been shown that 
it is fast for g > 6 [HE]- Surprisingly, the efficiency remains unresolved for g = 4 or 5. 

In higher dimensions much less is known when q is small. The belief among physi- 
cists working in the field is that Glauber dynamics on 3-colorings is torpidly mixing 
when the dimension d of the cubic lattice is large enough (see e.g. the discussions in 
[311 [35]), but there are no rigorous results. Here, we obtain the first such rigorous 
result by proving torpid mixing of the chain on cubic lattices with periodic boundary 
conditions. 

Formally, we consider 3-colorings of the even discrete torus ZJ^ . This is the graph 
on vertex set [n]*^ (with n even) with edge set consisting of those pairs of vertices that 
differ on exactly one coordinate and differ by 1 (mod n) on that coordinate. For a 
Markov chain on C3 = Cj.iTL'l^ we denote by tj^ the mixing time of the chain (see 
Section [3] for a precise definition). We prove the following. 

Theorem 1.4 There is a constant do > for which the following holds. For d > d^ 

and n > A even, the Glauber dynamics chain Ai^ on C3 satisfies 



When n = 2, ZJ^ becomes the Hamming cube {0, l}*^. Slow mixing of Glauber dynamics 
for sampling 3-colorings was proved in this case in [12]. As with the case of phase 
coexistence, we strongly believe that torpid mixing holds for all g > 3 as well, as long 
as the dimension is sufficiently high. 

Conjecture 1.5 For all g > 3, all even n > 2 and all sufficiently large d = d{q), the 
mixing time of the Glauber dynamics chain Aig on Cg is (essentially) exponential in 



Our techniques actually apply to a more general class of chains. A Markov chain 
on state space C3 is said to be p-local if, in each step of the chain, at most p\V\ 
vertices have their colors changed; that is, if 



These types of chains were introduced in [9J, where the terminology p\V\- cautious was 
employed. We prove the following, which easily implies Theorem 11.41 

Theorem 1.6 Fix p > satisfying H{p) + p < 1. There is a constant do = (io(p) > 
for which the following holds. For d > d^ and n > 4 even, if Ai is an ergodic p-local 
Markov chain on C3 with uniform stationary distribution then 




n 



d-l 



Pm{Xi,X2) 7^ ^ \{veV: xi{v) ^ X2{v)}\ < p\V 
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Here H{x) = — xlogx — (1 — x) log(l — x) is the usual binary entropy function. Note 
that all p < 0.22 satisfy H{p) + p < 1. 

We show phase transition using a Peierls argument, to be discussed in detail in 
Section 12.31 We show torpid mixing via a conductance argument by identifying a cut 
in the state space requiring exponential time to cross. For both results, our work builds 
heavily on technical machinery introduced by Galvin and Kahn [T3] showing that the 
hard-core (independent set) model on Z'' exhibits phase transition for some values of 
the density parameter A that go to zero as the dimension grows. Specifically, for A > 0, 
choose I from X(A„) (the set of independent sets of the box A„ = {— n, . . . ,n}) with 
Pr(I = /) oc A'"'^'. Galvin and Kahn showed that for A > Cd~^^'^ log^^^ d (for some 
constant C) and fixed v E £, 

hm P(t; G I I I 3 9intA„ r\S)> "^""^^^^^ 

n^oo 1 + A 

whereas 

lim P (t; G I I I D 9i„tA„ H C) < (1 + xy-^-^i^-^W) 

where 9intA„ is the set of vertices in A„ that are adjacent (in Z'^) to something outside 
A„. In other words, the influence of the boundary on the center of a large box persists 
as the boundary recedes. 

Neither the results of IH] (showing phase coexistence for the hard-core model on 
Z*^) nor Theorem 11.11 (concerning 3-colorings of Z"^) directly imply anything about 
the behavior of Markov chains on finite lattice regions. However, they do suggest 
that in the finite setting, typical configurations fall into the distinct classes described 
in stationarity and that local Markov chains will be unlikely to move between these 
classes; the remaining configurations are expected to have negligible weight for large 
lattice regions, even when they are finite. 

Galvin [13] extended the results of p3], showing that in sufficiently high dimen- 
sion, Glauber dynamics on independent sets mixes torpidly in rectangular regions of 
Z*^ with periodic boundary conditions. Similar results were known previously about 
independent sets; however, one significant new contribution of [13] was showing that 
as d increases, the critical A above which Glauber dynamics mixes torpidly tends to 0. 
In particular, there is some dimension do such that for all d > do, Glauber dynamics 
will be torpid on Z'^ when A = 1. This turns out to be the crucial new ingredient 
allowing us to rigorously verify phase transition and torpid mixing for 3-colorings in 
high dimensions, as there turns out to be a close connection between the independent 
set model at A = 1 and the 3-coloring model. Unlike many statistical physics models, 
the 3-coloring model does not come equipped with a parameter such as A that can 
be tweaked to establish desired bounds; this makes the proofs here significantly more 
delicate than the usual phase-transition and torpid-mixing arguments. 

The rest of the paper is laid out as follows. In Section [2] we give the proof of Theorem 
II. II (phase transition), modulo one of our two main technical lemmas. Lemma [2731 This 
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section also provides an overview of our proof strategy (Section l2.3p . In Section |3l we 
give the proof of Theorem 11.61 (torpid mixing), modulo the second main technical 
lemma, Lemma 13.21 Section H] provides the proofs of Lemmas 12.31 and 13. 2[ while in 
Section [5] we prove Theorem 11.21 (measures of maximal entropy). 

The original aim of this work was to prove Theorem 11.11 (phase coexistence). We 
achieved this at a 2002 Newton Institute programmej^l and the second author discussed 
the result in talks and in communications with R. Kotecky and others. We then noticed 
that with some additional work we could obtain a proof of Theorem ll.6l (torpid mixing). 
This result was presented by the first and third authors in [15] , which also includes the 
first mention of Theorem 11.11 in print. During preparation of the present manuscript 
we learned from Ron Peled of his recent [27], whose main result contains Theorem ll.il 
(of which he heard from Kotecky only after proving his result [28]). Though similar 
in spirit, the approach of [27], which exploits a correspondence between colorings and 
height functions, is different from the present argument, which stays within the world 
of colorings. 



2 Proof of Theorem 1.1 



In this section we show that the 3-coloring model on Z"^ admits multiple Gibbs measures 
for all sufficiently large d (Theorem II. ip . 

2.1 Some notation 

Let S = {y, E) be a bipartite graph with bipartition classes £ and O. For X C \/, 
write V(X) for the set of edges in E that have one end in X and one end outside X; 
X for V" \ X; 9intX for the set of vertices in X that are adjacent to something outside 
X; SextX for the set of vertices outside X that are adjacent to something in X; X+ for 
X U ^extX; X^ for X n ^ and X^ for X n O. Further, ioi x eV set dx = de^t{x}. We 
abuse notation slightly, identifying sets of vertices of V and the subgraphs they induce. 

2.2 Finitizing Theorem 11.1! 

Theorem 1 1 . 1 1 may be finitized as follows. Set A = A„ = {—n, . . . , n}'^. (Throughout, n 
will be fixed, so we drop the dependence in the notation.) Set 

= {X G C3(A) : xk^^Ano = 0} 
and for f o G (A \ 9intA) fl £ set 

Cfivo) = {xe : x{vo) = 0}. 



* Isaac Newton Institute for Mathematical Sciences programme on Computation, Combinatorics 
and Probability, 29 Jul- 20 Dec 2002, h.ttp://www.newton.ac.uk/prograinines/CMP/. 
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In other words, is the subset of (proper) 3-colorings of A in which all of the odd 
vertices on the boundary get color 0, while C^{vq) is the set of those colorings in which 
even vq also gets color 0. We prove the following. 

Theorem 2.1 For alln, 

1^3^ (^o) I ^ -n{d) 

\c^\ 

as d — 7- oo (with the implicit constant independent of n). 

With some extra work we could replace e"^'-'^-' here with 2^2'^(^^°(^)). This would require 
dealing more carefully with small cq in Lemma 12.31 and to simplify the presentation 
we chose not to do this. The interested reader may consult [IA\ (and in particular the 
end of Section 2.13 of that reference) for the approach. 

Theorem 12.11 implies Theorem 11.11 Indeed, let fj.^^''-^^ be any subsequential limit of 
the /in*'°'^^'s, where the notation is as in the discussion before the statement of Theorem 
11.11 From Theorem 12.11 we immediately have /i*^°''^^(x('y) = 0) < e"'^^'^^ and so, by 
symmetry, /i*-'^''^''(x(^) = 1) = f^^^''^Kx{^) = 2) > 1/2 — e"^'^'^). A second application of 
Theorem 12. II (together with a union bound) shows that, for Q = {x(f') Wv' ~ w}, 
^(o,o)(g) = 1 _ e"f^('^), whence 

/.(°'^)(X(^) = 0) = = 0|g) > (1 - e-^(^))/2; 

and then symmetry gives the second inequality in Theorem II. 1[ 

2.3 Preview 

For a generic x ^ ^F('^o) there is a region of A around vq consisting predominantly 
of even vertices colored together with their neighbors, and a region around (9intA 
consisting of odd vertices colored together with their neighbors. These regions are 
separated by a two-layer 0-free moat or cutset. In Section [2^ we describe a procedure 
that associates a particular such cutset with each x ^ ^F('^o)- Our main technical 
result. Lemma [2. 3[ asserts that for each possible cutset size c, the probability that the 
cutset associated with a uniformly chosen coloring has size c is exponentially small in 
c. This lemma is presented in Section 12. 5| where it is also used to derive Theorem 12.11 
We use a variant of the Peierls argument (originally presented in |26]) to prove 
Lemma [2731 By carefully modifying x ^ CF(^o) inside its cutset, we can exploit the 
fact that the cutset is 0-free to map x to a set ^p{x) of many different x' £ C^- If the 
y3(x)'s were disjoint for distinct x's, we would be done, having shown that there are 
many more 3-colorings in than in C^{v()). To control the possible overlap, we define 
a flow u : C^{vo) x — j- [0, oo) supported on pairs (x? x') with x' ^ V^(x) iii such 
a way that the flow out of each x ^ ^si'^o) is 1- Any uniform bound we can obtain 
on the flow into elements of is then easily seen to be a bound on \C^{vo)\/\C^\. 
We define the flow via a notion of approximation modified from [H]. To each cutset 
7 we associate a set ^(7) that approximates the interior of 7 in a precise sense, in 
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such a way that as we run over all possible 7, the total number of approximate sets 
used is small. Then for each x' ^ ^"^^ each approximation A, we consider the set 
of those X ^ ^s'l'^o) with x' ^ vix) and with A the approximation to 7. We define 
the flow so that if this set is large, then v{Xi x') is small for each x iii the set. In this 
way we control the flow into x' corresponding to each approximation A; since the total 
number of approximations is small, we control the total flow into x'- the language 
of statistical physics, this approximation scheme is a course- graining argument. The 
details appear in Section HI 

The main result of [H] is proved along similar lines to those described above. One 
of the difficulties we encounter in moving from these arguments on independent sets 
to arguments on colorings is that of finding an analogous way of modifying a coloring 
inside a cutset in order to exploit the fact that it is 0-free. The beginning of Section 
m (in particular Claims 14.11 and 14. 2p describes an appropriate modification that has all 
the properties we desire. 

2.4 Cutsets 

We now describe a way of associating with each x ^ (^0) a minimal edge cutset, 
following an approach of [3] and [13]. (An alternate construction is given in |T4|. The 
present construction is perhaps more transparent.) 

Given x e Cf{vo) set I = I{x) = X~\0)- Note that / is an independent set (a set of 
vertices no two of which are adjacent). Let R be the component of that includes 

vo- Let C be the component of R, that includes 5intA. Set 7 = 7(x) = V(C) and 
W = W{x) = C. Evidently C is connected, and W consists of R, which is connected, 
together with a number of other components of R, each of which are joined to R; so W 
is also connected. It follows that 7 is a minimal edge-cutset in A, separating vq from 
5intA. Note that 7 depends only on the independent set /. Note also that the vertex 
set of 7 is dintW U dextW. We write I7I for the size (number of edges) of 7. 

The next lemma summarizes the properties of 7 that we will draw upon in what 
follows; having established these properties we will not subsequently refer to the details 
of the construction. For the most part these properties will not be used directly, but 
will be referred to to validate the applications of various results from [H] . 

Lemma 2.2 For each x ^ C^i^o) we have the following. 

v^eW and SjntA n = 0; (2) 

di^tW(ZO and d.^^W <Z 8- (3) 

9intW^n/ = and 9extW^n/ = 0; (4) 

G dirJV, dvnWrM^^; (5) 

= dextW^ and = {y e £ : dy C W'^} (6) 

and 

for large enough d, I7I > max{|W^|-^~^/'^, rf^}. (7) 
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Proof: That Vq eW and 9intA nW = is clear. 

Properties ([3]), dlj), dS]) and are also easily verified; see [131 Lemma 3.3] or [HI 
Proposition 2.6] for detailed proofs. 

The isoperimetric inequality of BoUobas and Leader p, Theorem 3] says that if 
W C A satisfies < then \V{W)\ > \W\^'^/'^. Since W n ^mtA = we 

may apply this (perhaps with W viewed as a subset of a larger A) to conclude that 
I7I > For the second inequality in ([7]), note that by we have I7I = 

2d{\W'^\ - \W^\). In [H Lemma 2.13] it is shown that if A, S C A satisfy ACS, 
B CO, B = 9ext^, A = {v e £ : dv C B}, {AU B) n dA = dl, and \B\ < (P^^'^ then 

- |A| > |5|(1 -0(l/rf)). By (12]) and ([6D, VT^ and VT^ satisfy these conditions, and 
so noting that > 2d, we get I7I > 2d'^{l - o(l)) > d'^ for < rf^^^); the first 

inequality in ([7]) implies the second for all larger \W\. □ 

The cutsets also satisfy a connectivity property (specifically, that di^tW U d^^tW 
induces a connected graph). We will not use this property explicitly in the sequel; it 
is an important ingredient in the proof of Lemma 14.31 (a combination of results from 
[T3] and [H]) where it serves to bound the number of cutsets of a given size that use 
a given edge. 

2.5 The main lemma for phase transition 

For Co G N set 

W{cq,Vq) = {7 : I7I = Co, 7 = 7(x) for some x ^ C^i^o)} 

and set W = UcoW(co,fo). Set (co,fo) = {x ^ (fo) : |7(x)| = co} • The main 
technical lemma we need to prove phase transition is the following. 

Lemma 2.3 There are constants C,do > such that the following holds. For all 

d> do, n and cq, 



We give the proof in Section HI 

From Lemma 12. 3[ we easily obtain Theorem 12.11 Indeed, for all n and d > d^ we 
have (using for the restriction on co) 




co>d? 
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3 Proof of Theorem 



1.6 



The aim of this section is to show that in the finite setting of the discrete torus, Glauber 
dynamics for sampling from 3-colorings mixes torpidly (Theorem II. 6p . We begin by 
formalizing some definitions. Given an ergodic Markov chain A4 on state space Q with 
stationary distribution vr, let P*(x, ■) be the distribution of the chain at time t given 
that it started in state x. The mixing time tj^ of Ai is defined to be 



fo:maxl5^|P*(x,y) 
L yen 



Tm = min <^ to '■ max - ^ ^ |P*(a;, y) — 7r(y) | < - \/t > 



We prove Theorem 11.61 via a well-known conductance argument [211 IMl [33] , using 
a form of the argument derived in |9]. Let A <^VL and M C f2 \ A satisfy t^{A) < 1/2 
and cui e A, a;2 e n \ (A U M) ^ P{^i, ^2) = 0. Then from [9] we have 

. > 



i7v{M)' ^ ' 

Let us return to the setup of Theorem II. 6 [ For even n, is bipartite with partition 
classes £■ (consisting of those vertices the sum of whose coordinates is even) and O. We 
will show that most 3-colorings have an imbalance whereby the vertices colored lie 
either predominantly in E or predominantly in (9, and those that are roughly balanced 
are highly unlikely in stationarity. Accordingly let us define the set of balanced 3- 
colorings by 

= {X G C3 : llx-'(O) n ^l-lx-'(O) n 0|| < pn^jl} 

and let 

Cs'" = {x G C3 : 1x^^(0) n ^1 - Ix-'(o) n 0| > ^^^2}. 

By symmetry, 7r3(C3''') < 1/2 (recall that tts is uniform distribution). Notice that 
since M. updates at most pn!^ vertices in each step, we have that if xi G Cf'^ and 
X2 e Cs \ (Cf U Cs^'") then P^(xi, X2) = 0. Therefore, by iD, 

-..{C',') ^ 1 - 7r3(Cf'0 



and so Theorem 11.61 follows from the following critical theorem. 

Theorem 3.1 Fix p > satisfying H{p) + p < 1. There is a constant do = (io(p) > 
for which the following holds. For d> d^ and n > A even, 



71-3(^3''') < exp 



d^ log n 
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3.1 Cutsets revisited 



One difficulty we have to overcome in moving from a Gibbs measure argument to a 
torpid mixing argument is that of going from bounding the probabihty of a configu- 
ration having a single cutset to bounding the probability of it having an ensemble of 
cutsets. Another difficulty is that the cutsets we consider in these ensembles can be 
topologically more complex than the connected cutsets that are considered in the phase 
transition result. In part, both of these difficulties are dealt with by the machinery 
developed in [13] . 

We begin by describing a way of associating with each x ^ ^s'' a collection of 
minimal edge cutsets, extending the process described in Section 12.41 

For X £ ^3'' set / = /(x) = Given a component R of (/^)^ or and 

a component C of 'R, set 7 = 7(i?,C,x) = V(C) and W = W{R,C,x) = C. As in 
Section |2]U 7 is a minimal edge-cutset in ZJ^. Define int7, the interior of 7, to be the 
smaller of C,W {if \W\ = \C\, take int7 = W). 

The collection of cutsets associated to x depend only on the independent set /, 
and coincide exactly with the cutsets associated to an independent set in [13]. We 
may therefore apply the machinery developed in [13] for independent set cutsets in the 
present setting. In particular, from [13] Lemmas 3.1 and 3.2] we know that for each 
X G C3 there is a subset T{x) of the collection of cutsets associated to x that either 
satisfies 

for all 7 G r(x), int7 = W, for all 7, 7' G T{x) with 7 7^ 7', int7 fl int7' = 0, , . 
and for all 7 G r(x), R is a component of (/^)^ and C U^gr(x)iiit75 

or the analogue of ([9]) with S replaced by O. Set Cf™'^ = {x ^ ^3 '■ X satisfies Q}. 
From here on whenever x ^ Cf™ is given we assume that / is its associated independent 
set and that T{x) is a particular collection of cutsets associated with x and satisfying 
©. 

The cutsets that we have constructed here have many properties in common with 
those constructed in Section [231 in particular, each 7 G T{x) satisfies ([3]), (jll), ([5]), ([6]) 
and ([7]). The proof of ([7]) appeals to [2, Theorem 8] instead of [2], Theorem 3] and uses 
the fact that for large enough n and for |-B| = d'^^^^ we may apply [T^ Lemma 2.13] in 
the setting of the torus without modification. 

The cutsets in T{x) also satisfy a connectivity property, although because the torus 
is topologically more complex than Z'^ the connectivity property is more involved. In 
[13] Lemma 3.4] it is shown that each 7 G T{x) is either connected in the dual of the 
torus (the graph on the edges of the torus in which two edges are adjacent if there is 
a 4-cycle including both of them) or has at least n''"^ edges in each component. As in 
the case of phase transition, this property is important in the proof of Lemma 14. 3[ but 
since we take this lemma directly from [i3\ we do not give further details here. 
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3.2 The main lemma for torpid mixing 

For c G N and v E V set 

W(c,f) = {7 : I7I = c, 7 G r(x) for some x ^ Cg™", and v G (int7)^} 

and set W = Uc,^W(c, f). A profile of a collection {70,..., 7^} C W is a vector 
p = (co, f 0, . . . , Q, f^) with 7j G W(ci, fj) for all i. Given a profile p set 

C3{p) = {x G Cg™" : r(x) contains a subset with profile p} . 

Our main lemma (c.f. [131 Lemma 3.5]) is the following. 

Lemma 3.2 There are constants C,do > such that the following holds. For all even 
n > 4 and d > do, and all profiles p as above, 

T^-iiCsip)) < exp 

3.3 Proof of Theorem [331 

We will prove Lemma IX^ in Section |H Here, we derive Theorem 13 . 1 1 from it. Through- 
out we assume that the conditions of Theorem 13.11 and Lemma 13.21 are satisfied (with 
do sufficiently large to support our assertions). 

We begin with an easy count that dispenses with colorings where |/(x)| is small. 

Set 

Cr'' = [x e ^3^'" : min{|/^|, \I^\} < ^| . 
Lemma 3.3 n-s^Cf^^^^) < exp {-fi(n'^)}. 

Proof. For any A (1 S and B 'O O, let comp(y4, B) be the number of components in 
V\{AUBUd*AU d*B), where for T C £ (or C), 

d^'T = {xe (9extT : dx CT} {= {x eV : dx C T}). 

We begin by noting that by S-O symmetry 

l^smaiij < 2^exp2{|5M| + \d* B\ + comp{A, B)} , (10) 

where the sum is over all pairs A (1 S, B (1 O with no edges between A and B and 
satisfying \A\ < n'^/Ad^'^ and \B\ < {p + l/2d^'^)n'^ /2. Indeed, once we have specified 
that the set of vertices colored is A U 5, we have a free choice between 1 and 2 for 
the color at x G A U d*B, and we also have a free choice between the two possible 
colorings of each component oiV \{AVJ B U d*A U d*B). 
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A key observation is the following. For A and B contributing to the sum in (fTOll . 



comp{AB)<—. (11) 

To see this, let C be a component of V \ {AU B). If C = {v} consists of a single 
vertex, then (depending on the parity of v) we have either dv ^ A or dv C B and so 
V G d*A U d*B. Otherwise, let vw be an edge of C with v E S (and so w E O). If v 
has k edges to B and u has i to A, then (since there are no edges from A to B) we 
have {k — 1) + {£ — 1) <2d — 2oTk + i< 2d. (Here we are using that in Z^, if uv E E 
then there is a matching between all but one of the neighbors of u and v.) Since v has 
2d—l — k edges to C \ (5 U {w}) and w has 2c/ — 1 — £ edges to £\{ALi {v}) we have 
that \C\ =4:d-{k + i)> 2d. From this (HI]) follows. 

Inserting (11 II) into (ITU]) and bounding |c}*A| and |9*-B| by the maximum values of 
|A| and |5| (valid since T C f (or O) satisfies |T| < |9extT|, so |(9*T| < |T|) and with 
the remaining inequalities justified below, we have 

< -p^{y(/^ + ^ + ^ + ^(^)+^(/^ + ^))} (12) 

< exp^jy (1-1^(1)) I (13) 



for sufficiently large d = d{p). In ^ we use the bound ^Jff/' (f) < 2^(^)^^ for /3 < i; 
in ( IT3]) we use -ff(p) + p < 1. Using 2"''/^ < IC3I, the lemma follows. □ 

We now consider 

^large, even (Q^'P \ ^smalK p ^even 

3 \ 3 \ 3 / 3 

By Lemma IX^ and £-0 symmetry. Theorem 13.11 reduces to bounding (say) 

vr3(Cr^^'-^'^)<exp(--|f^|. (14) 

[ # log n ) 

Let C]^^^^' be the set of x ^ C^^'^^'^' '^^^^ such that there is a 7 G T{x) with 

I7I > n^-i and let C^""^^'' = Cl,^'^'' \ C^""^^'' °\ We assert that 



vr3(C^^^' "*) < exp |-^] ) !> (15) 



~d 



and 

//7laree, even, triv\ ^ I I /-i 

7r3(C3 ) < exp <^ - ^ \ ; (16) 

# log n 
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this gives (fT^ and so completes the proof of Theorem I3.1[ Both (fT5|) and (fT6|) are 
corollaries of Lemma I3.2[ and the steps are identical to those that are used to bound 
the measures of IZ^eXet^ '^illl'eleven in US Section 3.3]. We now give the details. 

With the sum below running over all profiles p of the form (c, v) with v &V and 
c > n'^~^ , and with the inequalities justified below, we have 



^^^^large, even, nt^ < Y.'',{C,{p)) 



E 



< n'^exp^-n ( ] } (17) 



< exp < — ^7 



d 



giving (1151) . We use Lemma [3.21 in f ll7p . The factor of n^*^ is for the choices of c and v. 

The verification of f|T6|) involves finding an i e [f2(log(i), 0((i log n)] and a set 
ri(x) ^ r(x) of cutsets with the properties that |rj(x)| n'^/2\ I7I ^ 2* for each 
7 G rj(x) and J^-rer.ix) 1^1 ^ 'n.'^^^. The measure of C3'^'^^*^' is then at most 

the product of a term that is exponentially small in n'^"^ (from Lemma [3.2p . a term 
corresponding to the choice of a fixed vertex in each of the interiors, and a term cor- 
responding to the choice of the collection of cutset sizes. The second term will be 
negligible because Ti{x) is small and the third will be negligible because all 7 G Ti{x) 
have similar sizes. 

More precisely, for x ^ C]^"^^*^' and 7 G T{x) W6 have I7I > |int7|"^~^/'^ (by 

d?])) and so 

j2 i7i'^/('^~i) > J2 lintel ^ i^^i ^ ^74^1/2. 

76r(x) 7er(x) 

The second inequality is from and the third follows since x Q™^". 

Set Ti{x) = {7 G r(x) : 2"-^ < \l\ < 2'}. Note that Ti{x) is empty for 2' < (P 
(again by ([7])) and for 2*^^ > n'^^^ so we may assume that 

21ogc/ < i < (c/- l)logn + 1. (18) 

Since ^^=1 1/rn? = there is an i such that 

7er.(x) ^ ^ 

Choose the smallest such i and set ^ = |rj(x)|. We have J^-yeViix) 1^1 — ^(^2*) (this 
follows from the fact that each 7 G Ti{x) satisfies I7I > 2*^-*^) and 
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The first inequality follows from that fact that ItI < dn'^ =1-^1; the second follows 
from (HM and the fact that each 7 has < 2'**/('^~i). We therefore have x ^ ^3(p) 

for some p = (ci, f 1, . . . , q, vi) with i satisfying (pi]) , with 

^c,>0(£2^), (21) 

i=i 

with 

Cj- < 2' (22) 

for each j and with i satisfying (1181) . With the sum below running over all p satisfying 
dm), (EH) and (122]) we have 

^^^^large, even, triv) < VTs (C3 (p) ) . (23) 

The right-hand side of fl23p is, by Lemma 13.2] at most 

dlogn max |2'^*^'^ ^ exp | ■ ^ satisfying ([18]) | . 

The factor of dlogn is an upper bound on the number of choices for i; the factor of 2^* 
is for the choice of the c/s; and the factor (" ) is for the choice of the i (distinct) Vj's. 
By (I18p and the second inequality in fl20p we have (for d sufficiently large) 

2'^ (t) ^ (t)' - < 2^'^ = exp jo } , 

so that in fact the right-hand side of ( 123]) is at most 

f /27 
dlogn maxexp< —fl I — 



Taking i as small as possible we see that this is at most 



d log n max exp < — ^7 



and taking i as large as possible we see that it is at most exp{— 4?t,'^ ^/d'^log^n}. 
Putting these observation together we obtain (|T6]) . 



4 Proof of Lemmas 2.3 and 3.2 



In this section we complete the proofs of Theorems 11.11 and 11.41 by establishing the 
two technical statements concerning cutsets from Sections [2] and El Much of what 
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follows is modified from [13] and [H] . Because the cutsets described in Sections 12.41 
and 13.11 are quite similar, the two proofs proceed almost identically, and we give them 
in parallel. Before beginning this process we reduce Lemma 13.21 to below. Let 
p = (co, f 0, . . . , Q, vg) be given. Set p' = {ci,vi, . . . , q, Vi). We will show 



|C3(P)| 



<exp{-f](^)} (24) 



from which Lemma IX^ follows by a telescoping product. To obtain f l24p we define a one- 
to-many map (p from C^^p) to C^^p'). We then define a flow u : C^i^p) x Csi^p') — )■ [0, oo) 
supported on pairs (x, x') with x' ^ vix) satisfying 

VxGC3(p), 5^ KX,X') = 1 (25) 

X'G</3(x) 

and 

Vx'gC3(p:), Y1 Kx,x')<exp{-l](|)}. (26) 

This easily gives fl24p . To obtain Lemma 12.3^ we prove a variant of with C^lp') 
replaced by and C^i^p) replaced by C^{co,Vo). 

In what follows, we write V for both C3{p') and Cf', and C for both C3(p) and 
C|^(co,fo), and we use V both for the vertex set of and that of A. 

For each s G {±1, • ■ ■ , ±<i}, define cXs, the s/iz/t m direction s, by crs(a;) = x + e^, 
where is the sth standard basis vector if s > and = — e_s if s < 0. For X CV 
write or,(X) for {(T,(x) : x G X}. For 7 G W set W = {x e dintW : cr_,(x) ^ VT}. 

Let X G C be given. For Lemma |3.2[ arbitrarily pick 7 G r(x) fl W{co,vq) and 
set FT = int7. For Lemma |2.3[ simply take 7 = 7(x) and W = W {'-/). Write / for 
the map from {0, 1, 2} to {0, 1, 2} that sends to and transposes 1 and 2. For each 
s G {±1, ...,±d} and S CW define the function Xs-V^ {0, 1, 2} by 

Xsiv) = { xiv) iive (W^ \S)U{V\W) 




,{v))) ifveW\W' 
and set ^s{x) = {Xs ■ S C W'}. 
Claim 4.1 ips{x) ^ V. 

Proof. We begin with the observation that the graph di^tW U dg^tW is bipartite with 
bipartition (dintW, de^tW). This follows from ©• By Q, / n {d^^tW U ^ext^^) = 
and so for each component U of di^tW U 9extW^, X is identically 1 on one of U H di-at^, 
U n de^iW and identically 2 on the other. 

Our main task is to show that if six) ^ ^si that is, that for any S C and edge 
uv, X%{u) 7^ X%iy)- We consider several cases. 
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li u,v then = x{u) and Xsi^) = x{v). But 7^ so 7^ 

Xl('y) in this case. 

If u E W and v ^ W then xM'^^) = xl"^) and G {0, (we will justify this 

in a moment). Since v G 9extW^ we have xi'^) 7^ a-^^d we cannot ever have x(^) = xi'^)^ 
so xK"") 7^ xK^) ill this case. To see that Xsi"^) ^ {O^xl"^)}) we consider subcases. 
Uu e S then xK^^) = 0. Uu eW'\S then xK^^) = xiu). Finally, if u e \ 
then Xs('w) = f{x{'^-s{u))); and is either or depending on whether 

x(o"_s(tt)) equals or x(t>) (x((7_s(m)) cannot equal x{u)). 

lfu,veW\W' then xM^) = f{x{(^-s{u))) and Xs(^^) = /(x(o^-s(t^))). Since / is 
a bijection and x(<7-s(w)) 7^ xlf^-sl"^)) w'e have Xsi"^) 7^ xK"^) i^i this case. 

If u E W\ and v G VT'^ \ then xK"") ^ {0, x('w)} (a^s in the second case above) 
and x%iy) = xiv). Since xi^) ^ 0, we have x|(m) 7^ X|(^)- 

Noting that it is not possible to have both u,v E W^, we finally treat the case 
where u E W \ and v E S. In this case xK^) = xl"^) = 0. Suppose (for a 
contradiction) that xK"^) = 0- This can only happen if x{<^-s{u)) = 0. If cr^s^u) = v, 
we have a contradiction immediately. Otherwise, we have cr_s(f) ^ ly and so (since 
cr_s(u)o"_s(f ) G -E) cr_s(u) G c^intW^, also a contradiction. 

This verifies ^Ps{x) ^ C3. We now verify that ^Ps{x) ^ the setting of Lemma 

I3.2l this is true because W is disjoint from the interiors of the remaining cutsets in r(x) 
and the operation that creates the elements of fsix) only modifies x inside W. In the 
setting of Lemma [2731 it follows from the fact that W fl 9intA = 0. □ 

Claim 4.2 Given x' G v^s(x); X can 6e uniquely reconstructed from W and s. 
Proof. We may reconstruct x via 

/ x'(^) if T;Gy\W^ 

^^''^ I fix'Mv))) ifvew. 

□ 

We define the one-to-many map cp from C to "D by setting (p{x) = V's(x) for a 
particular direction s. To define u and s, we employ the notion of approximation also 
used in and based on ideas introduced by Sapozhenko in [31J. For 7 G W, we say 
A CV is an approximation of 7 if 

D and C W^, 

d^o (x) >2d-Vd for all x e A^ 

and 

c^£:\^£(a;) >2d-Vd for all ?/ G C \ A^, 

where dx{x) = \dx fl X\. Note that by (jS]), 1^(7) is an approximation of 7. 

Before stating our main approximation lemma, it will be convenient to further refine 
our partition of cutsets. To this end set 

yV{we,Wo,Vo) = {7 : 7 G W with \W^\ = Wo, \W^\ = We and Vq G W^} . 
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Note that by we have I7I = 2d{\W^\ - \W^\) so W{we,Wo,Vo) C W{{wo - 

We)/2d,Vo). 

Lemma 4.3 For each We, Wo and vq there is a family A{we,Wo,VQ) of subsets of V 
satisfying 

\A{we,Wo,vo)\ < exp jo (^{wo - We)d~^ log^ d^ I 

and a map n : W{we,Wo,Vo) — t- A{we,Wo,Vo) such that for each 7 G W{we,Wo,Vo) , 
7r(7) is an approximation ofj. 



Proof. In the setting of Lemma [2.31 this is exactly [T^ Lemma 2.18]; in the setting of 
Lemma [3.21 it is [T3l Lemma 4.2]. □ 

In both settings, the proof proceeds along the same lines. We begin by associating 
with each cutset a small set of vertices (much smaller than the size of the cutset) which 
weakly approximates the cutset in the sense that the neighborhood of the associated set 
separates the interior of the cutset from the exterior. This part of the proof combines 
algorithmic and probabilistic elements, and relies heavily on the structure of the lattice. 
The total number of weak approximations that can arise as we run over all cutsets 
of a given size is controlled in part by the fact that these weak approximations are 
connected (in a suitable sense); this property is inherited from the connectivity of 
the cutsets themselves. The second part of the proof proceeds by refining the weak 
approximations into approximations in the sense defined above. This part of the proof 
is purely algorithmic and uses no properties of the lattice other than that it is regular 
and bipartite. 

We are now in a position to define u and s. Recall that we have fixed, for each 
X G f~^{x')y ci particular cutset 7. Our plan is to fix We, Wo and A G A{we, Wo, Vq) and 
to consider the contribution to the sum in fl2^ from those x ^ f^^ix') with 7r(7) = A. 
We will try to define u in such a way that each of these individual contributions to 
fl26|) is small; to succeed in this endeavor we must first choose s with care. To this end, 
given A G A{we, Wo, fo) set 

= A'nd,^t{0\A'^) and = (O \ A^) n Sext^^. 

To motivate the introduction of and Q'-', note that for 7 G 7t^^{A) we have (by 
([6]) and the definition of approximation) 

A^\Q^ C W^, 

and 

c \ (A^ u g*^) c c \ w^. 
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It follows that for each 7 e 7r~^{A), U Q'-^ contains all vertices whose location in 
the partition V = W UW is as yet unknown. 

We choose s(x) to be the smallest s for which both of \W^\ > .8{wo — We) and 
Ws{Q^) n Q^l < b\W'\/\/d hold. This is the direction that minimizes the uncertainty 
to be resolved when we attempt to reconstruct x from the partial information provided 
by x' ^ V~^ix)^ s ^-iid A. (That such an s exists is established in [HI (49) and (50)] 
by an easy averaging argument). Note that s depends on 7 but not /. 

Now for each x ^ C let 7 G r(x) be a particular cutset with 7 e W(co,fo). Let 
^{x) be as defined before, with s as specified above. Define 

C = W'nA^ nas{Q^) 

and 

D = W'\C, 

and for each x! ^ V'(x) set 

/ix|cn7(x')l /3x|C\/(x')l /^xlDI 

Note that for x ^ ""^(x')) ^(x? x') depends on but not on x itself. 

Since C U D partitions W we easily have ( l25l) . To obtain fl24l) we must establish 
(EE]). 

Fix We, Wo such that 2d{wo—We) = Cq. Fix A G ^(we, "W^o, "^^o) and s G {±1, . . . , ±(i}. 
For X with 7 G W{we, Wo, Vq) write x ~s ^ if it holds that 7r(7) = A and s(x) = s. We 
claim that with A, s, Wo and We fixed, for x' G P 

^) • ^^^^ 

We now describe the proof of (!27j) . Write C(we, tf^o, s, A, x') for the set of all x G C 
such that W G >V(we,Wo,fo), 7r(7) = A, s{x) = s and x' ^ V^(x) a,nd set f/ = 

n cr-s(x')- Say that a triple {K,L,M) is g'ood for x if it satisfies the following 
conditions. 

U L U M is a minimal vertex cover of U Q'^, 
K CQ'^, L CU and M CQ^ \U 

and 

K = d,^,iU\L). 

We begin by establishing that x ^ C(we, Wo, A, x') always has a good triple. 
Lemma 4.4 For eac/i x ^ C(we, Wo, A, xO ^/^e ihp/e 

(i^, L, M) := (ly n g*^, \ vr, (q^ \ ?7) \ ly) 

zs g'ood /or X- 
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Proof, [m around discussion of (54)]. □ 

In view of Lemma 14.41 there is a triple (i^, L, M) that is good for x ^"^^ which 
has li^'l + \L\ as small as possible. Choose one such, say (-K'o(x)) -^o(x)! ^o(x))- Set 
K'{x) = Kq\K and L'{x) = Lq\ L. Lemma 14.51 below establishes an upper bound 
on t^iXyX') ill terms of \Ko\, \Lo\, \K'\ and \L'\, and Lemma [4.61 shows that for each 
choice of K' , V there is at most one x contributing to the sum in the lemma. These 
two lemmas combine to give ( l27j) . 

Lemma 4.5 For each x £ C{Wf,, Wo, s, A, x'), 



V3\ 2l^ol 



2 / ^\Ko\ + \Lo\2\K'HL'\ 

B{K',L'). 



Proof. We follow [14i from just before (55) to just after (60)], making superficial changes 
of notation. □ 

The inequality in Lemma 14.51 is the 3-coloring analogue of the main inequality of 
[13]. The key observation that makes this inequality useful is the following. 

Lemma 4.6 For each We, Wq, s, A, x' , K' and V , there is at most one x with x G 
C{w,,Wo,s,A,x'), K' = K\x) andV = L\x)- 

Proof. In [14, (56) and following] it is shown that K' and L' determine W'^ via 

k = {K,\K')\j{d,^,L'r\Q'^) 

and so W (via = {v e £ : dv W^}). But then by Claim S^li^' and L' determine 

X- n 

Lemmas 14.51 and 14.61 together now easily give fl271) : 

X&C{we,Wo,s,A,x') K'CKo, L'CLo 



UQ — we 



< 



2 



We have now almost reached (l26i) . With the steps justified below we have that for 
each G 



2d 



£0 

2d 



< 2dct'\Aiw,,Wo,Vo)\ i^^j (28) 

2d 

< 2dc;^-' exp{-Q{co/d)} (29) 

< exp{-fi(co/d)}, (30) 



21 



completing the proof of (126|) . In the first inequahty, the sum on the right-hand side 
is over all choices of We, Wo, s and A. In ( 125]) . we note that there are \A{we,Wo,Vo)\ 
choices for A, 2d choices for s and c^^^'^ choices for each of We, Wo (this is because 
Co > (we + WoY^^^'^, by (J?!)), and we apply ( 1271) to bound the summand. In ( l29i) we 
use Lemma 14.31 Finally in ( l30l) we use cq > d'^ (again by (CD) to bound 2dcf'^'^~^^ = 
exp{o{co/d)}. 



5 Proof of Theorem 11.21 (measures of maximal en- 
tropy) 

Here we establish that the Gibbs measure studied in Theorem 11.11 is a measure of 
maximal entropy. Recall that for a probability distribution X with finite range that 
takes on value x with probability p{x), the entropy of X is 

H{X) = - ^ p{x)\ogp{x). 

We have H{X) < log |range(X)| with equality if and only if X is uniform. 

Let An be the box {—n, . . . ,n}'^, and let C'^{An) be the set of colorings of A„ that 
can be extended to a coloring of Z'^. The topological entropy of C3 (the set of 3-colorings 
of Z"') is 

7/*°P°(C:3) = lim 



log 


C^(An)| 


|An 





Let fi be any measure on (C3, J^cyi) and let X„ be the restriction to A„ of an element of 
C3 chosen according to fi (so the range of X„ is a subset of C3). The measure-theoretic 
entropy of C3 with respect to /x is 

n^^iCs) = hm ^^^"^ 



n— >oo 



|A„| 



Note that ^^{C^) is always at most '1-0'"^"{C^). We say that /i is a measure of maximal 
entropy if 1-L^{C^) = 'H*°p°(C3). The sense of measure of maximal entropy is that the 
restriction of /i to any finite subset of Z"' is supported (asymptotically) on as large a 
set as possible. (See e.g. [7j for a more thorough discussion of these topics.) 

We wish to show that jji^^^'^'> (as described in the introduction) is a measure of 
maximal entropy. Fix m and n satisfying m > n. Let fim = /im^'*^^ be as described in 
the introduction, and let X™ be the restriction to A„ of a coloring chosen according to 
fim- We will show that 

HiX;:') > log |C^(A„)| -2|9intA„| logs. (31) 

This is enough to show that is a measure of maximal entropy, since |9intA„| = 

o(log|C^(A„)|). 
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Since for any random variable X we have H{X) > — logmaXj.p(x), we will have 
(31) if we show that, for each r G C'^{An), we have 

Q2|9intA„| 

-^^^M '''' 

We need the following lemma. Here S is an arbitrary finite bipartite graph with 
bipartition S U O. 

Lemma 5.1 Fix S' C S and O' C O arbitrarily and let n he uniform measure on 
C3(S). For any £" C S \ £' , O" CO\ O' , 

= on S" andx=l onO" \ x = on S' and x = I on O') > 3-l^"uo"l. 

Proof. We proceed by induction on \S" U 0"\, beginning with the case \S" U 0"\ = 1. 
Without loss of generality, we may take O" = and S" = {x} for some x E £\£' . 
Write C for the set of those x satisfying x\£' = x\o' = and, for i e {0, 1, 2}, 
write Cj' for {x G C : x{^) = i}- We wish to show that |Cq|/|C'| > 1/3, for which (by 
1-2 symmetry) it is enough to show \C[\ < \C'q\. 

To verify this last inequality, consider the following map from C[ to C: for x G 
let C be the set of vertices in E reachable from x using only vertices colored and 
1, and let x' be obtained from x by interchanging and 1 on C. We must have 
C r\{S' U O') = (since otherwise we would have an odd path from x to S' or an even 
path from x to O'), so that in fact x' ^ ^'o- Moreover, the map is injective since we can 
recover x by interchanging and 1 on the set of vertices in S reachable from x using 
only vertices colored and 1 (under x')- 

For the induction step, consider the case \S" U 0"\ = t > 1 where without loss of 
generality \S"\ > 0. Fix x G S". We have 

fi(x = on £" and x = 1 on | x = on and x = 1 on O') = 
fi[x = on S" \ {x} and x = 1 on C>" | x = on and x = 1 on O') x 
fi{x{x) = I X = on f ' U {S" \ {x}) and X = 1 on O' U O"). 

The first term in the product above is at least 1/3 (it is another instance of the base 
case), and the second term is at least 3^*^*^^) (by induction), so the product is at least 
3"*. □ 

Now let A = Wm \ (A„ \ 9intA„) (recall from Section [1] that Wm is the box 
{— m, . . . , m}'^ together with all of the odd vertices of the box { — (m + 1), . . . , m + 1}*^). 
For r G C'^{An), let N{t) be the number of x ^ ^si^) that agree with r on A„ and can 
be extended to colorings in supp(/im) '■= {x ^ : fimix) > 0}. Thus N{t) depends 
only on the restriction of r to (9intA„, and N{t) = Pr(X^ = r)|supp(/im,)|- 

Set 

a(A„) = {to G C'^{An) : To = on (9intA„) nO and tq = 1 on (9intA„,) n £}. 
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By Lemma ED (with S = A U d^^tW^, £' U O' = de^tW^ and £" U O" = 9intA„) we 
have, for any Tq G C*(A„) and r G C3(A„), 

A^(ro) > 3~l^'"*^"lAr(r) = 3'!^'"'^"' Pr(X;:^ = r)|supp(/i^)|. (33) 

Another apphcation of Lemma O (with S = A„, U C = and U O" = dint^n) 
yields 

|C^(An)| - 

and so, since A^(ro)/|supp(/im) | = Pr(X^ = tq) < |C*(A„)|"^ we get 

A^(^o) < |^.^^^^| |supp(/ij|. (34) 
Combining (jSS]) and dM]) we get 
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